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Abstract — Rank modulation is a way of encoding information 
to correct errors in flash memory devices as well as impulse 
noise in transmission lines. Modeling rank modulation involves 
construction of packings of the space of permutations equipped 
with the Kendall tau distance. 

We present several general constructions of codes in permuta- 
tions that cover a broad range of code parameters. In particular, 
we show a number of ways in which conventional error-correcting 
codes can be modified to correct errors in the Kendall space. 
Codes that we construct afford simple encoding and decoding 
algorithms of essentially the same complexity as required to 
correct errors in the Hamming metric. For instance, from binary 
BCH codes we obtain codes correcting t Kendall errors in n 
memory cells that support the order of 7i!/(log 2 n!)' messages, 
for any constant t = 1,2,.... We also construct families of 
codes that correct a number of errors that grows with n at 
varying rates, from 6(n) to 9(n 2 ). One of our constructions 
gives rise to a family of rank modulation codes for which the 
trade-off between the number of messages and the number of 
correctable Kendall errors approaches the optimal scaling rate. 
Finally, we list a number of possibilities for constructing codes of 
finite length, and give examples of rank modulation codes with 
specific parameters. 

Index Terms — Flash memory, codes in permutations, rank 
modulation, transpositions, Kendall tau distance, Gray map 



I. Introduction 

Recently considerable attention in the literature was devoted 
to coding problems for non-volatile memory devices, including 
error correction in various models as well as data manage- 
ment in memories O, 0, lfT3l - |[T5l . Non-volatile memories, 
in particular flash memory devices, store data by injecting 
charges of varying levels in memory cells that form the device. 
The current technology supports multi-level cells with two or 
more charge levels. The write procedure into the memory is 
asymmetric in that it is possible to increase the charge of an 
individual cell, while to decrease the charge one must erase 
and overwrite a large block of cells using a mechanism called 
block erasure. This raises the issue of data management in 
memory, requiring data encoding for efficient rewriting of the 
data [12 |. A related issue concerns the reliability of the stored 
information which is affected by the drift of the charge of 
the cells caused by ageing devices or other reasons. Since 
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the drift in different cells may occur at different speed, errors 
introduced in the data are adequately accounted for by tracking 
the relative value of adjacent cells rather than the absolute 
values of cell charges. Storing information in relative values 
of the charges also simplifies the rewriting of the data because 
we do not need to reach any particular value of the charge as 
long as we have the desired ranking, thereby reducing the risk 
of overprogramming. Based on these ideas, Jiang et al. [14], 
lfT31l suggested to use the rank modulation scheme for error- 
correcting coding of data in flash memories. A similar noise 
model arises in transmission over channels subject to impulse 
noise that changes the value of the signal substantially but 
has less effect on the relative magnitude of the neighboring 
signals. In an earlier work devoted to modeling impulse noise, 
Chadwick and Kurz [6| introduced the same error model and 
considered coding problems for rank modulation. Drift of 
resistance in memory cells is also the main source of errors 
in multilevel-cell phase-change memories [22|. 

Motivated by the application to flash memories, we consider 
reliable storage of information in the rank modulation scheme. 
Relative ranks of cell charges in a block of n cells define a 
permutation on the set of n elements. Our problem therefore 
can be formulated as encoding of data into permutations so 
that it can be recovered from errors introduced by the drift 
(decrease) of the cell charges. 

To define the error process formally, let [n] — { 1 , 2 . . . , n} 
be a set of n elements and consider the set S n of permu- 
tations of [n]. In this paper we use a one-line notation for 
permutations: for instance (2,1,3) refers to the permutation 
(213)' R e f errm g to the discussion of charge levels of cells, 
permutation (2,1,3) means that the highest-charged cell is 
the second one followed by the first and then the third cell. 
Permutations can be multiplied by applying them successively 
to the set [n], namely the action of the permutation tt a, where 
7r,cr £ S n , results in z 1 — ^ a(it(i)),i — l,...,n. (Here and 
elsewhere we assume that permutations act on the right). Every 
permutation has an inverse, denoted cr -1 , and e denotes the 
identity permutation. 

Let er = (er(l), . . . , cr(n)) be a permutation of [n]. An 
elementary error occurs when the charge of cell j passes the 
level of the charge of the cell with rank one smaller than 
the rank of j. If the n-block is encoded into a permutation, cr, 
this error corresponds to the exchanging of the locations of the 
elements and cr(j + 1) in the permutation. For instance, 
let a = (3, 1, 4, 2) then the effect of the error -n = (2,1,3, 4) is 
to exchange the locations of the two highest-ranked elements, 
i.e., 7R7 = (1,3,4,2). 

Accordingly, define the Kendall tau distance d T (a, ir) from 
cr to another permutation tt as the minimum number of trans- 
positions of pairwise adjacent elements required to change 
a into 7r. Denote by X n — (S n ,d T ) the metric space of 
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permutations on n elements equipped with the distance d T . 
The Kendall metric was studied in statistics [16] where it was 
introduced as a measure of proximity of data samples, as well 
as in combinatorics and coding theory 0, iflOl . The Kendall 
metric also arises naturally as a Cayley metric on the group 
S n if one takes the adjacent transpositions as its generators. 

The Kendall distance is one of many metrics on per- 
mutations considered in the literature; see the survey (9). 
Coding for the Hamming metric was considered recently in 
following the observation in [25| that permutation arrays 
are useful for error correction in powerline communication. 
Papers ll20"l . 11231 . ll24l considered coding for the distance 
on permutations from the perspective of the rank modulation 
scheme. Generalizations of Gray codes for rank modulation 
are considered in f26l . while an application of LDPC codes 
to this scheme is proposed in ||27l . 

An (n, d) code C C X n is a set of permutations in S n such 
that the minimum distance d T separating any two of them 
is at least d. The main questions associated with the coding 
problem for the Kendall space X n are to establish the size 
of optimal codes that correct a given number of errors and, 
more importantly, to construct explicit coding schemes. In our 
previous work we addressed the first of these problems, 
analyzing both the finite-length and the asymptotic bounds on 
codes. Since the maximum value of the distance in X n is (™), 
this leaves a number of possibilities for the scaling rate of the 
distance for asymptotic analysis, ranging from d = 0(n) to 
d = Q(n 2 ). Define the rate of the code 



fl(C) = log|C|/log(n! 



(1) 



(all logarithms are base 2 unless otherwise mentioned) and let 

R(n, d) = max R(C) (2) 
<g{d) = lim R(n, d) (3) 



where the maximum in is over all codes with distance > d. 
We have the following result. 

Theorem 1: |3 1 The limit in exists, and 

!1 if d = 0{n) 

1-c ifd = 9(n 1+e ), 0<e<l (4) 
if d = G(n 2 ). 

Moreover, 

'0(log -1 ra) \fd = Q(n 2 ) 
l-0(log _1 n) if d=0{n). 



R(n, d) 



We remark [3| that the equality ^(d) = 1 — e holds under a 



slightly weaker condition, namely, d 



,1 + e 



a(n), where a(n) 



grows slower than any positive power of n. 

Equation suggests the following definition. Let us say 
that an infinite family of codes scales optimally if there exists 
e 6 (0,1) such that, for any positive a, (3, all codes of the 
family of length n larger than some no, have rate at least 
1 — e — /3 and minimum distance n(n 1+e ~ a ). 

The proof of Theorem [TJ relied on near-isometric embed- 
dings of X„ into other metric spaces that provide insights into 
the asymptotic size of codes. We also showed [3| that there 



exists a family of rank modulation codes that correct a constant 
number of errors and have size within a constant factor of the 
upper (sphere packing) bound. 

Regarding the problem of explicit constructions, apart from 
a construction in |[T5l of codes that correct one Kendall error, 
no other code families for the Kendall distance are presently 
known. Addressing this issue, we provide several general 
constructions of codes that cover a broad range of parameters 
in terms of the code length n and the number of correctable 
errors. We present constructions of rank modulation codes that 
correct a given number of errors as well as several asymptotic 
results that cover the entire range of possibilities for the scaling 
of the number of errors with the code's length. Sect. HI] we 
present a construction of low-rate rank modulation codes that 
form subcodes of Reed-Solomon codes, and can be decoded 
using their decoding algorithms. In Sect.|III]we present another 
construction that gives rank modulation codes capable of 
correcting errors whose multiplicity can be anywhere from 
a constant to 0(n 1+e ), < e < 1 /2, although the code rate is 
below the optimal rate of (gj. Relying on this construction, we 
also show that there exist sequences of rank modulation codes 
derived from binary codes whose parameters exhibit the same 
scaling rate as for any < e < 1. Moreover, we show 
that almost all linear binary codes can be used to construct 
rank modulation codes with this optimal trade-off. Finally, we 
present a third construction of rank modulation codes from 
codes in the Hamming space that correct a large number of 
errors. If the number of errors grows as 8(rt 2 ), then the rate of 
the codes obtained from binary codes using this construction 
attains the optimal scaling of 0(log _1 n). Generalizing this 
construction to start from nonbinary codes, we design families 
of rank modulation codes that scale optimally (in the sense of 
the above definition) for all values of e, < e < 1. 

Finally, Sect. [TV] contains some examples of codes obtained 
using the new constructions proposed here. 

Our constructions rely on codes that correct conven- 
tional (Hamming) errors, converting them into Kendall-error- 
correcting codes. For this reason, the proposed methods can be 
applied to most families of codes designed for the Hamming 
distance, thereby drawing on the rich variety of available con- 
structions with their simple encoding and decoding algorithms. 

II. Construction I: Rank modulation codes from 

PERMUTATION POLYNOMIALS 

Our first construction of rank modulation codes is alge- 
braic in nature. Let q — p rn for some prime p and let 
¥ q = {ao, a\, . . . , a q -i} be the finite field of q elements. A 
polynomial g{x) £ ¥ q [x\ is called a permutation polynomial 
if the values g(a) are distinct for distinct values of a € F, 
lfT9l Ch. 7]. 

Consider the evaluation map / M> (f(ao), . . . , f{a q -i)) 
which sends permutation polynomials to permutations of n 
elements. Evaluations of permutation polynomials of degree 
< k form a subset of a g-ary Reed-Solomon code of dimension 
k + 1. Reed-Solomon codes are a family of error-correcting 
codes in the Hamming space with a number of desirable 
properties including efficient decoding. For an introduction to 
them see |2I Ch. 10]. 
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At the same time, evaluating the size of a rank modulation 
code constructed in this way is a difficult problem because 
it is hard to compute the number of permutation polynomials 
of a given degree. In this section we formalize a strategy for 
constructing codes along these lines. This does not result in 
very good rank modulation codes; in fact, our later combina- 
torial constructions will be better in terms of the size of the 
codes with given error-correcting capabilities. Nonetheless, the 
construction involves some interesting observations which is 
why we decided to include it. 

A polynomial over ¥ q is called linearized of degree v if it 
has the form 



£{x) 



i=0 



Note that a linearized polynomial of degree v has degree p v 
when viewed as a standard polynomial. 

Lemma 2: The number of linearized polynomials over F g 
of degree less than or equal to v that are permutation polyno- 



mials in F q is at least 
1 



1 - 



1 



,H-i 



>q v - 



\ p - 1 q(p - 

Proof: The polynomial £(x) acts on ¥ q as a linear 
homomorphism. It is injective if and only if it has a trivial 
kernel, in other words if the only root of £(x) in F g is 0. 
Hence, £(x) is a permutation polynomial if and only if the 
only root of C(x) in ¥ q is 0. 

The total number of linearized polynomials of degree up to 
v is q v+1 . We are going to prove that at least a (1 — + 
j^zrn ) proportion of them are permutation polynomials. To 
show this, choose the coefficients <2j, < i < v of C(x) = 
2~2i=o a i xP uniformly and randomly from ¥ q . For a fixed a E 
¥*, the probability that £(a) = is 1/q, Furthermore, the set 
of roots of a linearized polynomial is an F p -vector space [21 
p. 119], hence the set of non-zero roots is a multiple of p — 1. 
The number of 1-dimensional subspaces of F g over F p is • 
The probability that one of these sets is included in the set of 
roots of C(x) is, from the union bound, 

g-1 1 

p- i q' 

Hence, the probability that C(x) is a permutation polynomial 
is greater than or equal to 1 — 1 ~ 



Pr(3a e F* : £(a) = 0) < 



9(p-i)- 



A. Code construction 



We use linearized permutation polynomials of ¥ q to con- 
struct codes in the space X n . Note that a linearized polynomial 
C(x) always maps zero to zero, so that when it is a permutation 
polynomial it can be considered to be a permutation of the 
elements of F q and also of the elements of F*. Let t be a 
positive integer and let v = Llog p (n — 2t — 1)J . Let Vt be the 
set of all linearized polynomials of degree < v that permute 
q - 1 and define the set A C F™ 



Fq. Set n 



A = {{£{a),ae¥Z), £ e V t } 



to be the set of vectors obtained by evaluating the polynomials 
in Vt at the points of F*. Form a code C T by writing the 



vectors in A as permutations (for that, we fix some bijection 
between [n] and F*, which will be implicit in the subsequent 
discussion). We can have n = q rather than n = q — 1 if 
desired: for that we add the zero field element in the first 
position of the (q — l)-tuples of A, and the construction below 
readily extends. 

The idea behind the construction is quite simple: the set A 
is a subset of a Reed-Solomon code that corrects t Hamming 
errors. Every Kendall error is a transposition, and as such, 
affects at most two coordinates of the codeword of C T . 
Therefore the code C T can correct up to t/2 errors. By handling 
Kendall errors more carefully, we can actually correct up to 
t errors. The main result of this part of our work is given by 
the following statement. 

Theorem 3: The code C T has length n = q — 1 and 
size at least gL 1 °s P (™~ 2t ^ 1 )J . it corrects all patterns of up 
to t Kendall errors in the rank modulation scheme under a 
decoding algorithm of complexity polynomial in n. 

Proof: It is clear that \C T \ — \A\, and from Lemma |2] 

\A\ > gHogpCn-at-l)^ 

Let a — (ai, a 2 , . . . , a i; a i+ i,. . . , a„), where a € ¥*, 1 < 
j < n, be a permutation in X n (with the implied bijection be- 
tween [n] and F*) and let a' = (ai, a 2 , . . . , a%+\-> °>i} ■ ■ ■ , a n) 
be a permutation obtained from a by one Kendall step (an 
adjacent transposition). We have 



(O,...,O,0, 



.,o) 



where 9 
Let 



H+l 



P 



G F, 

' 1 
1 
1 



V i i i 

be an n x n matrix. Note that 



\ 





1 / 



P(a-a'Y - (0,. .. ,0,0,0,. ..,0)\ 

This means that multiplication by the accumulator matrix P 
converts one adjacent transposition error into one Hamming 
error. Extending this observation, we claim that if d T (a, ir) < t 
with 7r being some permutation, and any t < 5, then the 
Hamming weight of the vector P(a — ir) T is not more than t. 
Here we again take a and tt to be vectors with elements from 
F* with the implied bijection between [n] and F*. 

Now let C(x) be a linearized permutation polynomial and 
let 1, a, a 2 , ... , ofi~ 2 be the elements of F* for some choice 
of the primitive element a. Let 

<j = (£(l),£(a),£(a 2 ),...,£(a q - 2 )). 

Since £(a + b) = £{a) + £(b), we have 

Pa T = (£(/? ),£(/?i),£(M . . . , £((3 q -2)) T 

where 



3=0 



0,1,.. 
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It is clear that /3, ^ 0, < i < n — 1 and also (3^ ^ (3i 2 
for < i\ < i% < n — 1, Therefore, the vector Pa T is a 
permutation of the elements of F* . At the same time, it is the 
evaluation vector of a polynomial of degree < n — 2t — 1. 
We conclude that the set {Pa T , u € A} is a subset of vectors 
of an (extended) Reed-Solomon code of length n, dimension 
n — 2t and distance 2t + 1. Any t errors in a codeword of such 
a code can be corrected by standard RS decoding algorithms 
in polynomial time. 

The following decoding algorithm of the code C T corrects 
any t Kendall errors. Suppose a € A is read off from memory 
as <j\. 

Decoding algorithm ( Construction I): 

• Evaluate z = Paf. 

» Use a Reed-Solomon decoding algorithm to correct up 
to t Hamming errors in the vector z, obtaining a vector 
y (if the Reed-Solomon decoder returns no results, the 
algorithm detects more than t errors). 

• Compute (7 = P~ x y T , i.e., 

Vi = yi+\-yi, l<i<n-l; <7 n = y n . 

The correctness of the algorithm follows from the construction. 
Namely, if d T (a, <j\) < t, then y corresponds to a transformed 
version of a, i.e., y = Pa T . Then the last step of the decoder 
correctly identifies the permutation a. ■ 

Some examples of code parameters arising from this theo- 
rem are given in Sect. [IV] 

We note an earlier use of permutation polynomials for 
constructing permutation codes in (7). At the same time, 
since the coding problem considered in that paper relies on 
the Hamming metric rather than the Kendall tau distance, its 
results have no immediate link to the above construction. 

III. Construction II: Rank modulation codes from 
the Gray map 

In this section we present constructions of rank modulation 
codes using a weight-preserving embedding of the Kendall 
space X n into a subset of integer vectors. To evaluate the error- 
correcting capability of the resulting codes, we further link 
codes over integers with codes correcting Hamming errors. 

A. From permutations to inversion vectors 

We begin with a description of basic properties of the 
distance d T such as its relation to the number of inversions 
in the permutation, and weight-preserving embeddings of S n 
into other metric spaces. Their proofs and a detailed discussion 
are found for instance in the books by Comtet [8| or Knuth 
ifTTl Sect. 5.1.1]. 

The distance d T is a right-invariant metric which means 
that d T {ui ) U2) — d T (p\o,oiG) for any a, ct\,02 <= S n 
where the operation is the usual multiplication of permutations. 
Therefore, we can define the weight of the permutation a as 
its distance to the identity permutation e = (1,2, ... ,n). 

Because of the invariance, the Cayley graph of S„ (i.e., 
the graph whose vertices are indexed by the permutations and 
whose edges connect permutations one Kendall step apart) is 



regular of degree n — 1. At the same time it is not distance- 
regular, and so the machinery of algebraic combinatorics does 
not apply to the analysis of the code structure. The diameter 
of the space X n equals TV = (™) and is realized by pairs of 
opposite permutations such as (1,2,3,4) and (4,3,2, 1). 

The main tool to study properties of d T is provided by 
the inversion vector of the permutation. An inversion in a 
permutation a E S n is a pair (i,j) such that i > j and 

cr_1 (i) > °" (*)• ft i s eas y to see tnat d T (a,e) — I{cr), the 
total number of inversions in a. Therefore, for any two permu- 
tations CTi,f72 we have d T [a\,ij2) = /(o^erf 1 ) — I(' T i' J 2 1 )- 
In other words, 

d T {a,n) = \{{i,j)E[n] 2 :i^j,ii- 1 (i)>iT- 1 {j), 

a-\i)<a-\ 3 )}\- 

To a permutation a G S n we associate an inversion vector 
x a e Q n = [0, 1] x [0,2] x ••• x [0,n - 1], where x a (i) = 
\{j € [n]:j < i + l,a-\j) > o-^i + l)}!, * = 1, . . . , n- 1. 
In words, x a (i),i = 1, ... ,n — 1 is the number of inversions 
in a in which i + 1 is the first element. For instance, we have 
a x a 
216437598 10103101 
It is well known that the mapping from permutations to the 
space of inversion vectors is bijective, and any permutation 
can be easily reconstructed from its inversion vectoiQ. Clearly, 

n-l 

I{a) = J2 x °(i)- (5) 

i=l 

Denote by J : Q n —> S n the inverse map from Q n to S n , 
so that J(x a ) = a. The correspondence between inversion 
vectors and permutations was used in lfT31 to construct rank 
modulation codes that correct one error. 

For the type of errors that we consider below we introduce 
the following l\ distance function on Q n : 

n-l 

di(x,y) = - y(i)\, {x,y€G„) (6) 

where the computations are performed over the integers, and 
write 1 1 a; 1 1 for the corresponding weight function (this is not a 
properly defined norm because Q n is not a linear space). Recall 
that d T (a,Tv) = I(7T(t _1 ); hence the relevance of the l\ dis- 
tance for our problem. For instance, let a% = (2, 1, 4, 3), 02 = 
(2, 3, 4, 1), then x ai = (1, 0, 1), x a2 — (1, 1, 1). To compute 
the distance d T (<7i,<72) we note that erf 1 = <j\ and so 

7(<7 2 c7r 1 )=/((l ) 4,3,2)) = ||(0,l,2)||=3. 

Observe that the mapping a — > x a is a weight-preserving 
bijection between X n and the set Q n . At the same time, 
the above example shows that this mapping is not distance 
preserving. Indeed, d T (cr\,G2) — 3 while d\ (x rTl , x a2 ) = 1. 
However, a weaker property pointed out in lfT31 is true, 
namely: 

1 There is more than one way to count inversions and to define the inversion 
vector: for instance, one can define x a (i) — \{j : j < i,cr(j) > a(i + 
= 1, ... ,n - 1. In this case, given a = (2, 1,6,4, 3,7, 5, 9,8) we 
would have x a = (1,0,2,1,2,0,0,1). The definition in the main text is 
better suited to our needs in that it supports Lemma f4] below. 



5 



Lemma 4: Let o\ , a 2 € S n , then 

d T (ai,a 2 ) > di(x ail x < j 2 ). 



(7) 



Proof: Let a(m), a(m + 1) be two adjacent elements in 
a permutation a. Let i = a{m),j = a(m + 1) and suppose 
that i < j. Form a permutation a' which is the same as a 
except that cr'(m) = j, a'(m + 1) = i, so that d r (er, a') = 1. 
The count of inversions for which i is the first element is 
unchanged, while the same for j has increased by one. We 
then have x a >(k) — x a (k),k ^ j and av(j) — x a (j) + 1. 
Thus, di(x a r,x,j) — 1. and the same conclusion is clearly 
true if i > j. 

Hence, if the Kendall distance between u\ and <j 2 is 1 then 
the l\ distance between the corresponding inversion vectors 
is also 1. Now consider two graphs G and G" with the same 
vertex set S n . In G there will be an edge between two vertices 
if and only if the Kendall distance between them is 1. On the 
other hand there will an edge between two vertices in G" if 
and only if the l\ distance between corresponding inversion 
vectors is 1. We have just shown that the set of edges of G 
is a subset of the set of edges of G". The Kendall distance 
between two permutations is the minimum distance between 
them in the graph G. A similar statement is true for the i\ 
distance with the graph G". 

This proves the lemma. ■ 

We conclude as follows. 

Proposition 5: If there exists a code C in Q n with l\ 
distance d then the set C T := {J{x) : x 6 C} forms a rank 
modulation code in S n of cardinality \C\ with Kendall distance 
at least d. 



Note the "reflective" nature of the map: the last 2 bits of the 
second block of four are a reflection of the last 2 digits of the 
first block with respect to the horizontal line; the last 3 bits 
of the second block of eight follow a similar rule, and so on. 
This property, easy to prove from (JHJ, will be put to use below 
(see Prop. [9). 



Now, for i = 2, . . . , n, let 



and let 



t/ji : {0,l} m * -> [0,i-l] 



be the inverse Gray map = . Clearly if>i is well defined; 
it is injective but not surjective for most i's since the size of 
its domain is only 2" li . 

Proposition 6: Suppose that x, y 6 {0, l} mi . Then 



\ipi(x) - ipi(y)\ > d H (x,y), 



(9) 



where dn denotes the Hamming distance. 

Proof: This follows from the fact that if u, v are two 
integers such that \u — v\ = 1, then their Gray images satisfy 
dH{4>(u), <fi(v)) = 1. If the number are such that u < v and 
| it — u| = d, then by the triangle inequality, 

d H (<t>(u),4>(v)) < d H (<p(u), (j)(u + 1)) 

+ --- + d H {<p(v-l),4>(v)) 

= d 



B. From inversion vectors to the Hamming space via Gray 
Map 

We will need the Gray map which is a mapping <fi s from the 
ordered set of integers [0, 2 s — 1] to {0, 1} S with the property 
that the images of two successive integers differ in exactly 
one bit. Suppose that fe s -i^s-2 • • • &o> bi € {0, 1}, < i < s, 
is the binary representation of an integer u 6 [0, 2 s — 1]. Set 
by definition b s = and define tj> 3 (u) = (g s -i,g s - 2 , ■ ■ -,9o), 
where 



9j = (bj+b j+ i) (mod 2) (j = 0,l,. 



1) 



(8) 



(note that for s > 4 there are several ways of defining maps 
from integers to binary vectors with the required property). 



Example: 
follows: 



The Gray map for the first 10 integers looks as 



000 
000 
000 
000 



00000 
00001 
000 1 
000 1 1 



0000 
0000 
0000 
0000 



0000 
0001 
00 11 
0010 



000 
000 
000 
000 



00100 
00101 
00110 
00111 



0000 
0000 
0000 
0000 



0110 
111 
10 1 
100 



000 
000 



01000 
01001 



0000 
0000 



1100 
110 1 



Consider a vector x = (xa|x3| . . . \x n ), where Xi € 
{0, l} mi , i — 2,...,n. The dimension of x equals m = 
532=2 mi ~ l°g n ': or more precisely 

m n — l 

m = ( 2J+1 - 23 ) 3 + m " + 1 - 2 m " ) 

771 n — 1 

= ]T jV +m„(n + l-2 m ") 
i=i 

= {m n -2)2 m " +2 + m„(n + l-2 m ") 
= (n + l)m„ - 2 m " +1 + 2. 

On the first line of this calculation we used the fact that among 
the numbers mi there are exactly 2 3+1 — 2 3 numbers equal to j 
for all j < n-l, namely those with i = 2 J ,2- J + 1, . . . ,2 3+1 - 
1. The remaining (n + 1) — 2 m " numbers equal m n . 
For a vector x E {0, l} m let 

*(a;) = ty(x 2 \x 3 \ . . . \x n ) = (ip 2 (x 2 ), ■ ■ . ,ip n (x n )). 



Proposition 7: Let x,y £ {0, l} m . Then 
di > d H (x,y), 
where the distance d\ is the t\ distance defined in (0. 
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Proof: Using (0, we obtain 

n 

2=2 

n 

> ^djj^yj 
= d(x,y). 



C. The code construction: correcting up to 0(n log n) number 
of errors 

Now we can formulate a general method to construct rank 
modulation codes. We begin with a binary code A of length 
to and cardinality M in the Hamming space. 

Encoding algorithm ( Construction II): 

• Given a message m encode it with the code A. We obtain 
a vector x £ {0, l} m . 

• Write x = (a^l^l . . . \x n ), where Xj G {0, l} mi . 
m Evaluate n = J(^f(x)) 

This algorithm is of essentially the same complexity as the 
encoding of the code A, and if this latter code is linear, 
is easy to implement, Both J and are injective, so the 
cardinality of the resulting code is M. Moreover, each of 
the two mappings can only increase the distance (namely, see 
(O and the previous Proposition). Summarizing, we have the 
following statement. 

Theorem 8: Let A be a binary code of length 

m = (n + l)LlognJ _ 2 L lo s"J+ 1 + 2, 

cardinality M and Hamming distance d. Then the set of 
permutations 

C T = {it e S n : 7T = J(*(x)), x e A} 

forms a rank modulation code on n elements of cardinality M 
with distance at least d in the Kendall space X n . 

The resulting rank modulation code C T can be decoded to 
correct any t = — 1) /2j Kendall errors if t Hamming errors 
are correctable with a decoding algorithm of the binary code 
A. Namely, suppose that a' is the permutation that represents 
a corrupted memory state. To recover the data we perform the 
following steps. 

Decoding algorithm ( Construction II): 

• Construct the inversion vector av • Form a new inversion 
vector y as follows. For i = 2, . . . ,n, if x„t(i — 1) € 
[0, i- 1] is greater than 2 rn ' - 1 then put y a , (i) = 2 m ' - 1, 
else put y a ,(i) = x„r(i). 

• Form a vector y G {0, l} m , y = {y 2 \y^ \ ■ ■ ■ \u n ) where 
y, € {0, l} mi is given by cj>i(y a , (i)). 

• Apply the t-error-correcting decoding algorithm of the 
code A to y. If the decoder returns no result, the 
algorithm detects more than t errors. Otherwise suppose 
that y is decoded as x. 

. Output a = J(Sb(x)). 



The correctness of this algorithm is justified as follows. 
Suppose a E C T is the original permutation written into the 
memory, and d T (cr, a') < t. Let x a be its inversion vector and 
let x be its Gray image, i.e., a vector such that ^(x) = x a . 
By Lemma|4]and Prop.|7]we conclude that dn{x, y) < t, and 
therefore the decoder of the code A correctly recovers x from 
y. Therefore a' will be decoded to a as desired. 

Example: Consider a t-error-correcting primitive BCH 
code A in the binary Hamming space of length to = (n + 
l)LlognJ - 2L 1 °g"J+ 1 + 2 and designed distance 2t + 1 
(generally, we will need to shorten the code to get to the 
desired length rri). The cardinality of the code satisfies 

2 m 



The previous theorem shows that we can construct a set of 
(n, M) rank modulation codes that correct t Kendall errors. 
Note that, by the sphere packing bound, the size of any code 
C € X n that corrects t Kendall errors satisfies \C\ = 0(n!/n*). 
The rank modulation codes constructed from binary BCH 
codes have size M = f2(n!/(logn!) 4 ) = 0(n!/(n* log* n)). 

Specific examples of code parameters that can be obtained 
from the above construction are given in Sect. HVl 

Remark (Encoding into permutations): Suppose that the 
construction in this section is used to encode binary messages 
into permutations (i.e., the code A in the above encoding algo- 
rithm is an identity map). We obtain an encoding procedure of 
binary m-bit messages into permutations of n symbols. This 
redundancy of this encoding equals 1 — to/ log(n!). Using the 
Stirling formula, we have for n > 1 

log rt! < log(v / 2~7m) + nlogn — (n — ") loge 

( H], Eq. 6.1.38). Writing m > (n + l)(logn - l)-2n+ 2, 
we can estimate the redundancy as 

to (3-loge)n 

1-1 r< ; ; , n>2. 

log n! log n! 

Thus the encoding is asymptotically nonredundant. The redun- 
dancy is the largest when n is a power of 2. It is less than 
10% for all n > 69, less than 7% for all n > 527, etc. 

D. Correcting 0(n 1+e ) number of errors, < e < x ji 

Consider now the case when the number of errors t grows 
with n. Since the binary codes constructed above are of length 
about n log n, we can obtain rank modulation codes in X n that 
correct error patterns of Kendall weight t = Q(nlogn). But 
in fact more is true. We need the following proposition. 

Proposition 9: Let x,y £ {0, l} m . Then 

4 (*(x),*(y))>Iizi(2^-l). 

Proof: Assume without loss of generality that x ^ y. We 
first claim that, for any such x,y £ {0, l} m % the inequality 
d H (x,y) >Wi>l implies that \ipi(x) - ipi(y)\ > 2 Wi ~ l . 
This is true because of the reflection property of the standard 
Gray map as exemplified above. 
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Now consider vectors x = (a^xa] . . . \x n ),y = 
(y 2 \y 3 \ . . . \y n ) in {0, 1}™ where x llVl e {0, 1}™% 2 < i < 
n. Suppose that dn{xi, y^) = Wi for all i, and Y%=2 Wi = w 
where w = dff(x,y). 

Hence, 

di(*(x),*(»)) = X;i^(a'i)-^(l/i)l 

i=2 

Wi >0 

" 1 

-E**- 1 - E 5 

i— 2 i: Wi— 

We do not have control over the number of nonzero Wi's, so 
let us take the worst case. We have 

" 1 

V 2 W ' > 2^*= 2 = 2T^T. 

^— ' n - 1 

i=2 

As for 53i-«; =oS' use trie trivial upper bound (n — l)/2. 
Together the last two results conclude the proof. ■ 
We have the following theorem as a result. 

Theorem 10: Let C and C T be the binary and rank modula- 
tion codes defined in Theorem[8] Suppose furthermore that the 
minimum Hamming distance d of the code C satisfies d = em, 
where to is the blocklength of C. Then the minimum Kendall 
distance of the code C T is J7(n 1+e ). 

Proof: We have logn — 1 < [log rzj < logn. Use this in 
the definition of m to obtain that m > n(logn — 3). Therefore, 
d = em > en(logn — 3). From the previous proposition the 
minimum Kendall distance of C T is at least 

" ~ 1 ( 2 6n(logn-3)/(n-l) _ ^ = Q( n l+e). 

■ 

Examples of specific codes that can be constructed from 
this theorem are again deferred to Sect. [IV] 

Let us analyze the asymptotic trade-off between the rate 
and the distance of the codes. We begin with an asymptoti- 
cally good family of binary codes, i.e., a sequence of codes 
Ci,i — 1,2..., of increasing length m for which the rate 
log \ Ci\/m converges to a positive number R, and the relative 
Hamming distance behaves as em, where < e < 1 /2. Such 
families of codes can be efficiently constructed by means of 
concatenating several short codes into a longer binary code 
(e.g., lETl Ch. 10]) Using this family in the previous theorem, 
we obtain a family of rank modulation codes in S n of Kendall 
distance that behaves as 57(n 1+e ), and of rate R (see (HJ). The 
upper limit of 1/2 on e is due to the fact |2~T1 p. 565] that no 
binary codes of large size (of positive rate) are capable of 
correcting a higher proportion of errors. 

E. Correcting even more, 0(n 1+e ), errors, 1 /2 < e < 1 

It is nevertheless possible to extend the above theorem to 
the case of e > 1 /2, obtaining rank modulation codes of 
distance fl(n 1+e ), 1/2 < e < 1 and positive rate. However, 
this extension is not direct, and results in an existential claim 
as opposed to the constructive results above. To be precise, one 



can show that for any < e < 1, there exist infinite families 
of binary (m, M, d) codes C, with rate R = 1 — e, such that 
the associated rank modulation code C T for permutations of 
[n] in Theorem [8] has minimum Kendall distance fl(n 1+£ ). 

Theorem 11: For any < e < 1, there exist infinite families 
of binary (m, M) codes C such that (1/m) log M — > 1 — e > 

0, and the associated rank modulation code C T constructed 
in Theorem [5] has minimum Kendall distance that scales as 
il(n 1+e ). Moreover all but an exponentially decaying fraction 
of the binary linear codes are such. 

The rank modulation codes described above have asymptot- 
ically optimal trade-off between the rate and the distance. 
Therefore, this family of codes achieves the capacity of rank 
modulation codes (see J3] Thm. 3.1]). 

To prove the above theorem we need the help of the 
following lemma. 

Lemma 12: Let < a < 1 and let T C [m], \T\ > am 
be a coordinate subset. There exists a binary code C of length 
m and any rate R < a such that the projections of any two 
codewords x,yGC,x^y on T are distinct. Moreover all 
but an exponentially decaying fraction of binary linear codes 
of any rate less than a are such. 

Proof: The proof is a standard application of the proba- 
bilistic method. Construct a random binary code C of length 
m and size M = 2 mR randomly and independently selecting 
M vectors from {0, l}" 1 with uniform probability. Denote by 
£ x y the event that two different vectors x,y g C agree on T. 
Clearly Pr^y) = 2~ am , for all x,y e C. The event £ x , v 
is dependent on at most 2(M — 1) other such events. Using 
the Lovasz Local Lemma [2], all such events can be avoided, 

1. e., 

Pr ( f| > °< 

x,yeC 

if 

e2- am (2M - 1) < 1 

or 

M < 2 mn - 1 /e + l/2. 

Hence as long as R < a, there exists a code of rate R that 
contains no pairs of vectors x, y that agree on T. This proves 
the first part of the lemma. 

To prove the claim regarding random linear codes chose a 
linear code C spanned by the rows of an mR x m binary matrix 
G each entry of which is chosen independently with P(0) = 
-P(l) = V 2 - The code C will not contain two codewords that 
project identically on T if the mR x \T\ submatrix of G with 
columns indexed by T has full rank. If mR < |T| then a 
given mR x |T| sub-matrix of G has full rank with probability 
at least 1 - 5 • 2-^~ mR ^' [11]. Thus if |T| grows at least 
as T = mR + \fm, the proportion of matrices G in which 
the (mR x T) submatrix is singular falls exponentially with 
m. Even if each of these matrices generates a different code, 
the proportion of undesirable codes will decline exponentially 
with to. ■ 
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Proof of Thm. [77} Suppose that x, y E {0, l} m where 
to = Y^h=2 m « an< ^ m « = U°g*J as above in this section. Let 

n 

di(*(x)Mv)) = E l^te) " ^ ™ 1+e 

for some < e < 1. Let < £ < 1. For at least a 1 - (3 
proportion of indices i we can claim that 



Compute the vector x = i3(b), where i? : {0, 1}" 1 
Q n is as follows: 



(n-1) 



On the other hand, if cCj and y i have the same value in 
the first ti of the rrii coordinates, then the construction of the 
Gray map implies that \t/ji(xi) ~ ipi(y i )\ > 2 nii ~ ti . Hence for 
at least a 1 — /3 fraction of the i's, 

2 m«-ti < 



P(n-iy 

i.e., ti > to, -elogn-log p^-i) - 

Therefore, x and y must coincide in a well-defined subset 
of coordinates of size 

r(i-/3)(„-i)] r(i-/3)(n-i)i 

E *» - E ( mi ~ elogn - log 

i=2 i=2 

r(l-/3)(r.-X)l 



/9(n-l) 



- e(l - /3)(n - 1) logn - O(n) 
= to(1 — e — 0(l/logn)). 

Invoking Lemma [T2] now concludes the proof: indeed, it 
implies that there exists a binary code of rate at least 1 — e 
where no such pair of vectors x and y exists. The claim about 
linear codes also follows immediately. ■ 

F. Construction III: A quantization map 

In this section we describe another construction of rank 
modulation codes from codes in the Hamming space over an 
alphabet of size q > 2. The focus of this construction is on the 
case when the number of errors is large, for instance, forms a 
proportion of n 2 . 

The first result in this section serves as a warm-up for a more 
involved construction given later. In the first construction we 
use binary codes in a rather simple manner to obtain codes in 
permutations. This nevertheless gives codes in X n that correct 
a large number of errors. Then we generalize the construction 
by using codes over larger alphabets. 

1) Construction IIIA: Rank modulation codes from binary 
base codes: Recall our notation Q n for the space of inversion 
vectors and the map J : Q n — > S n that sends them to 
permutations. Let C E {0, be a binary code that encodes 
k bits into n — 1 bits. 

Encoding algorithm (Construction IIIA): 

• Let m E {0, l} fe be a message. Find its encoding b with 
the code C. 



i,b 2 ,...,b n -x) i y x = (xi, . . . ,x n -x) 

jo if &i = 
I i if 6,- = 1 



i = 1, . . . , n — 1 
Find the encoding of m as a = J(x). 



Theorem 13: Let C(n — l,M,d > 2t + 1) be a code in 
the binary Hamming space and let C T C S n be the set 
of permutations obtained from it using the above encoding 
algorithm. Then the code C T C S n has cardinality M and 
corrects any r Kendall errors where r — t 2 /4 if t > 2 is even 
and r = (t 2 - l)/4 if t > 3 is odd. 

Proof: To prove the claim about error correction, consider 
the following decoding procedure of the code C T . Let it be a 
permutation read off from memory. 

Decoding algorithm (Construction IIIA): 

m Find the inversion vector x n — (xi, . . . , x n -i)- 
• Form a vector y£{0,l} n_1 by putting 



Vi = 



if Xi < LV2J 

1 if x, > [ l /2j . 



• Decode y with the code C to obtain a code vector c. If 
the decoder returns no result, the algorithm detects more 
than t errors. 

• Compute the overall decoding result as J(i?(c)). 

Let a be the original permutation, let x a be its inversion 
vector, and let c(a) be the corresponding codeword of C. The 
above decoding can go wrong only if the Hamming distance 
dn(c(a),y) > t. For this to happen the l\ distance between 
x v and x a must be large, in the worst case satisfying the 
condition di(x w ,Xa-) > X^=iLV2j- This gives the claimed 
result. ■ 

From a binary code in Hamming space of rate R that 
corrects any rn errors, the above construction produces a rank 
modulation code C T of size 2 Rn that is able to correct il(n 2 ) 
errors. The rate of the obtained code equals w i?(logn) _1 . 
According to Theorem [TJ this scaling is optimal for the 
multiplicity of errors considered. Some numerical examples 
are given in Sect. HV1 

2) Construction IIIB: Rank modulation codes from nonbi- 
nary codes: This construction can be further generalized to 
obtain codes that are able to correct a wide range of Kendall 
errors by observing that the quantization map employed above 
is a rather coarse tool which can be refined if we rely on codes 
in the g-ary Hamming space for q > 2. As a result, for any 
e < 1 we will be able to construct families of rank modulation 
codes of rate R = R(e) > that correct il(n 1+e ) Kendall 
errors. 

Let / > 1 be an integer. Let Q = {ai, a,2, ... , a q } be the 
code alphabet. Consider a code C of length n' = 2(l — l)(q—l) 
over Q and assume that it corrects any t Hamming errors 
(i.e., its minimum Hamming distance is at least 2t + 1). Let 
n = (21 + l)(g— 1). Consider the mapping <d q : — > G n , 
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defined as 0,(6) = (tfifa), tf 2 (6 2 ), • ■ • , 0„-i(&n-i)), & 
& n _i) e Q n - X , where 



i?i(aj) 



fO if i < 3(g- 1) 

(2fc — — 1) if (2k- i)(q- 1) <i 
< (2k+l)(q- 1) 

fe = 2, 3, . . . , (, 

j = l,2,3,... )(? . 



To construct a rank modulation code C T from the code C 
we perform the following steps. 

Encoding algorithm (Construction IIIB): 

• Encode the message m into a codeword c G C 

• Prepend the vector c with 3(q — 1) — 1 symbols a±. 

• Map the obtained (n — l)-dimensional vector to 5„ using 
the map J o Q q . 

The properties of this construction are summarized in the 
following statement. 

Theorem 14: Let ri = 2(1 - l)(q - l),n = (21 + l)(q - 
1),; > 2. Let C(n',M,d = 2t + 1) be a code in the q- 
ary Hamming space. Then the code C T C S n described by 
the above construction has cardinality M and corrects any r 
Kendall errors, where 

r = (t + l-(q-l)s)(s + l)-l 

ands= L(*+l)/(2(g-l))J,a>0. 

Proof: We generalize the proof of the previous theorem. 
Let 7r be the permutation read off from the memory. 



Decoding algorithm (Construction IIIB): 

• Find the inversion vector = (x\ , . . . 

• Form a g-ary vector y by putting 



) — 1 ) 



a\ if i < 3(q — 1) 

a,j if (2k- l)(q- 1) < i < (2k+l)(q- 1) 
and (2fc - l)(j - 1) - (fc - 1) < x 4 

< (2fe-l)(?-l)+fe, 
fc = 2,3 . . . , Z 



for i = 1, . . . ,n — 1. 

• Decode y' = (2/3(5-1), • • • ,2/n-i) with the code C to 
obtain a codevector c. If the decoder returns no results, 
the algorithm detects more than t errors. 

• Find the decoded permutation as a = J(Q q (c)). 

There will be an error in decoding only when y' contains at 
least t + 1 Hamming errors, y' contains coordinates 3(q — 1) 
to n — 1 of y. Suppose that tj, 1 < j < I — 1 is the number of 
Hamming errors in coordinates between (2j + l)(q — 1) and 

(2j + 3)0-1). We have Ej-=i * j > *+! and *j < 2 (9-!)- 
The l\ distance between the received and original inversion 



vectors equals 

z-i z-i 
> 7't.; > min > 

^ J J " f,<2( 9 -l) J 

J J2 J t J >t+i 3 

= 2(g-l)(l + 2 + --- + s) 

+ (t + l-2(g-l)s)(s + l) 

= (q - l)s(s + 1) + (t + 1 - 2(q - l)s)(s + 1) 

= (t + l-(q-l)s)(s + l). 

In estimating the minimum in the above calculation we have 
used the fact that the smaller-indexed tj 's should be given the 
maximum value before the higher-indexed ones are used. 

Therefore if the l\ distance between the received and 
original inversion vectors is less than or equal to r then 
decoding y' with the code C will recover x a . Using (|7]) we 
complete the proof. ■ 

Asymptotic analysis: For large values of the parameters we 
obtain that the number of errors correctable by C T is 

~ f- 
r ~4q 

or, in other words, d(C T ) ~ d 2 /8q. In particular, if d — n'S and 
q = 0(n 1 - t ), < e < 1, then we getd(C T ) = fl(n 1+t ). If the 
code C has cardinality q Rn ' then |C T | = q Rn ' = g «(»-3(«-i)). 
Using ([T]l yields the value (1 — e)R for the rate of the code 
C T . This is only by a factor of R less than the optimal scaling 
rate of @. To achieve the optimal asymptotic rate-distance 
trade-off one need to use a q-ary code of rate very close to 
one and non-vanishing relative distance; moreover q needs to 
grow with code length n as n 1_e . 

To show an example, let us take the family of linear codes 
on Hermitian curves (see e.g., ||4] Ch. 10]). The codes can be 
constructed over any alphabet of size q = b 2 , where b is a 
prime power. Let u be an integer, b + l<u<b 2 — b+1. The 
length n', dimension fc and Hamming distance d of the codes 
are as follows: 

n' = b 3 + l, k = (b+l)u-(l/2)b(b-l)+l, d > n'-(b+l)u. 

In the next section we will give a few examples of codes 
with specific parameters. For the moment, let us look at the 
scaling order of R and r as functions of the length of the codes 
C T obtained from the above arguments. We have n w qb, so 
q w n 2 / 3 , and 

k (b +l)u- (1/2)6(6-1) + ! 



R 



> 



6 3 + l 
b 3 - (b+l)u 



Let us choose u = b 2 /2, which gives R rj and 6 ~ ^a, 
where a = 1 — 0(l/b). Finally, we obtain that the rate of the 
codes CV behaves as 



log q 



Rn' 



-R(l-o(l)) 



log n! 

and the number of correctable Kendall errors is r ss 
(l/64)n 4 / 3 , which gives the scaling order mentioned in the 
previous paragraph for e = 1/3. 
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By taking u — 6 1+7 , for < 7 < 1, and by shortening the 
Hermitian code to the length \(b + l)u, for A > 1 arbitrarily 
close to 1 we obtain a code with rate arbitrarily close to 
1 with relative minimum distance equal to 1 — 1/A. This 
yields asymptotically optimal scaling, in the sense defined in 
section H] for values of e that range in the interval (0, 1/3). For 
values of e in the range (1/3, 1), families of codes with optimal 
scaling can similarly be constructed by starting from Algebraic 
Geometry codes with lengths that exceed larger powers of 
q than q 3 ^ 2 , for instance, codes from the Garcia-Stichtenoth 
curves or other curves with a large number of rational points. 

Another general example can be derived from the family 
of quadratic residue (QR) codes ||2T|| . Let p be a prime, then 
there exist QR codes over F^ of length n' = p, cardinality 
M = £(p +1 )/ 2 and distance > ^fp, where I is a prime that 
is a quadratic residue modulo p. Using them in Theorem [14] 
(after an appropriate shortening), we obtain rank modulation 
codes in S n , where n = p + 3(£ — 1), with cardinality M 
and distance d(C T ) = Q(p/£). Let us take a sufficiently large 
prime p and let I be a prime and a quadratic residue modulo 
p. Suppose that I = Q(pz~ a ) for some small a > 0. Pairs of 
primes with the needed properties can be shown to exist under 
the assumption that the generalized Riemann hypothesis is true 
(see e.g. lfl8l ). Using the corresponding QR code C in Theorem 
M we obtain n =p + 3(£-l) = G(p), d(C T ) = Q(n? +a ) 
and logM = 6(§(| — a)logn), giving the rate |(| — a). 
Although this trade-off does not achieve the scaling order of 
©, it still accounts for a good asymptotic family of codes. 

IV. Examples 

Below C T refers to the rank modulation code that we are 
constructing, M = \C T \, and t is the number of Kendall 
errors that it corrects. We write the code parameters as a 
triple (n, logM, d) where d = 2t + 1. In the examples we 
do not attempt to optimize the parameters of rank modulation 
codes; rather, our goal is to show that there is a large 
variety of constructions that can be adapted to the needs of 
concrete applications. More codes can be constructed from 
the codes obtained below by using standard operations such as 
shortening or lengthening of codes [3 1, [ 15 1. Note also that the 
design distance of rank modulation codes constructed below 
may be smaller than their true distance, so all the values of the 
distance given below are lower estimates of the actual values. 

From Theorem [3] we obtain codes with the following 
parameters. Let q — 2 l , then n = q — 1 and log M > 
l[\og(q — 2t — 2)J. For instance, let I = 6, then we ob- 
tain the triples (63, 30, 31), (63, 24, 47), etc. Taking I = 8, 
we obtain for instance the following sets of parameters: 
(255, 56, 127), (255, 48, 191). 

Better codes are constructed using Theorem [8] Let us take 
n = 62, then m — 253. Taking twice shortened BCH codes 
Bt of length rn, we obtain a range of rank modulation codes 
according to the designed distance of Bt- In particular, there 
are rank modulation codes in Xg2 with the parameters 

(62, 253 - St, 2t + 1), t = 1, 2, 3, 

Similarly, taking n = 105, we can construct a suite of rank 
modulation codes from shortened BCH codes of length m = 



510, obtaining codes C T with the parameters 

(105, 510 - M, 2t + 1), t = 1, 2, 3, . . . . 

We remark that for the case of t = 1 better codes were 
constructed in [15|. Namely, there exist single-error-correcting 
codes in S n of size M > n\/(2n). For instance, for n = 62 
this gives M = 2 277 064 as opposed to our M = 2 245 . A 
Hamming-type upper bound on M has the form 

M(t) < 



where 



K n (0) 
K n {l) 
K n {2) 

K n (3) 



1 

n — 1 

(n 2 -n-2)/2 
n + 1 N 
3 



n 



(see e.g., (T7J p. 15] which also gives a general formula for 
K n (i) for i < n). The codes constructed above are not close 
to this bound (note however that, except for small t, Hamming- 
type bounds are usually loose). 

Now let us use binary BCH codes in Theorem 
[T3l Starting with codes of length n' = 63, 255 we 
obtain rank modulation codes with the parameters 
(64, 36, 13), (64, 30, 19), (64, 24, 25), (64, 18, 51), (64, 16, 61), 
(64,10,85), (256,215,13), (256,207,19), (256,199,25), 
(256,191,33), etc. These codes are not so good for a 
small number of errors, but become better as their distance 
increases. 

Finally consider examples of codes constructed from The- 
orem Q~4] As our seed codes we consider the following 
possibilities: products of Reed-Solomon codes and codes on 
Hermitian curves. 

Let us take C = A ® B, where A[15, 9, 7] and B[U, 3, 12] 
are Reed-Solomon codes over F 16 . Then the code C has length 
ri = 14 • 15 = 210, (so / = 8), cardinality 16 27 = 2 108 and 
distance 84, so t — 41. From Theorem [T4l we obtain a rank 
modulation code C T with the parameters (n = 255, log M = 
108, d > 107). Some further sets of parameters for codes of 
length n = 255 obtained as we vary dim(B) are as follows: 

dim(B) 4 5 6 7 8 
logM 144 180 216 252 288 
d 95 79 67 55 49 

The code parameters obtained for n — 255 are better than 
the parameters obtained for the same length in the above 
examples with binary BCH codes, although decoding product 
RS codes is somewhat more difficult than decoding BCH codes 
On the other hand, relying on product RS codes offers a great 
deal of flexibility in terms of the resulting parameters of rank 
modulation codes. 

We have seen above that Hermitian codes account for some 
of the best asymptotic code families when used in Theorem 
[l~4l They can also be used to obtain good finite-length rank 
modulation codes. To give an example, let C be a projective 
Hermitian code of length 4097 over F 2 s. We have dim(C) = 
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17a - 119, d(C) > 4097 - 17a for any integer a such that 
17 < a < 240; see p. 441]. Let us delete any 17 coordinates 
(puncture the code) to get a code C with 

n' = 4080 = 16(q- 1), 
dim(C') = dim(C), 
d H (C) > n'- 17a. 

We have n = n' + 3(q- 1) = 4845. For a e {60, . . . , 100} we 
obtain a suite of rank modulation codes with the parameters 
(n, 7208, 6119), (n, 7344, 6071), . . ., (n, 12648, 4079). 

As a final remark, note that most existing coding schemes 
for the Hamming space, binary or not, can be used in one or 
more of our constructions to produce rank modulation codes. 
The decoding complexity of the obtained codes essentially 
equals the decoding complexity of decoding the original codes 
for correcting Hamming errors or for low error probability. 
This includes codes for which the Hamming distance is not 
known or not relevant for the decoding performance, such as 
LDPC and polar coding schemes. In this case, the performance 
of rank modulation schemes should be studied by computer 
simulations, similarly to the analysis of the codes used as 
building elements in the constructions. 

V. Conclusion 

We have constructed a number of large classes of rank 
modulation codes, associating them with binary and g-ary 
codes in the Hamming space. If the latter codes possess 
efficient decoding algorithms, then the methods discussed 
above translate these algorithms to decoding algorithms of 
rank modulation codes of essentially the same complexity. 
Our constructions also afford simple encoding of the data 
into permutations which essentially reduces to the encoding 
of linear error-correcting codes in the Hamming space. Thus, 
the existing theory of error-correcting codes can be used to 
design practical error-correcting codes and procedures for the 
rank modulation scheme. 

A direction of research that has not been addressed in 
the literature including the present work, is to construct an 
adequate model of a probabilistic communication channel that 
is associated with the Kendall tau distance. We believe that the 
underpinnings of the channel model should be related to the 
process of charge dissipation of cells in flash memory devices. 
Once a reasonably simple probabilistic description of the error 
process is formally modelled, the next task will be to examine 
the performance on that channel of code families constructed 
in this work. 



[5] Y. Cassuto, M. Schwartz, V. Bohossian, and J. Bmck, "Codes for 
asymmetric limited-magnitude errors with application to multilevel flash 
memories," IEEE Trans. Inform. Theory, vol. 56, no. 4, pp. 1582-1595, 
2010. 

[6] H. Chadwick and L. Kurz, "Rank permutation group codes based on 

Kendall's correlation statistic," IEEE Trans. Inform. Theory, vol. 15, no. 

2, pp. 306-315, 1969. 
[7] W. Chu, C.J. Colbourn ad P. Dukes, "Constructions for permutation 

codes in powerline communications," Designs, Codes and Cryptography, 

vol. 32, pp. 51-64, 2004. 
[8] L. Comtet, Advanced Combinatorics, Dordrecht, Netherlands: Reidel, 

1974. 

[9] M. Deza and T. Huang, "Metrics on permutations, a survey," /. Comhin. 

Inform. System Set, vol. 23, no. 1-4, pp. 173-185, 1998. 
[10] P. Diaconis and R. L. Graham, "Spearman's footrule as a measure of 

disarray," Journal of the Royal Statistical Society, Series B, vol. 39, 

no. 2, pp. 262-268, 1977. 
[11] F. Gerth, III, "Limit probabilities for coranks of matrices over GF(q), 

Linear and Multilinear Algebra, vol. 19, no. 1, pp. 79-93, 1986. 
[12] A. Jiang, V. Bohossian, and J. Brack, "Rewriting codes for joint 

information storage in flash memories," IEEE Trans, on Inform. Theory, 

vol. 56, no. 10, pp. 5300-5313, 2010. 
[13] A. Jiang, M. Langberg, R. Mateescu, and J. Brack, "Data movement 

in flash memories," Proc. 46th Annual Allerton Conf. Commun. Control 

Comput., Monticello, IL, Sept. 2009, pp. 1031-1038. 
[14] A. Jiang, R. Mateescu, M. Schwartz, and J. Brack, "Rank modulation 

for flash memories," IEEE Trans. Inform. Theory, vol. 55, no. 6, pp. 

2659-2673, 2009. 

[15] A. Jiang, M. Schwartz, and J. Brack, "Correcting charge-constrained 

errors in the rank-modulation scheme," IEEE Trans. Inform. Theory, 

vol. 56, no. 5, pp. 2112-2120, 2010. 
[16] M. Kendall and J. D. Gibbons, Rank Correlation Methods, 5th ed., 

Edward Arnold Publishing, London, 1990. 
[17] D. E. Knuth, The Art of Computer Programming, Volume 3: Sorting and 

Searching, Reading. MA: Addison-Wesley, 1973. 
[18] J.C. Lagarias, and A. M. Odlyzko, "Effective versions of the Chebotarev 

density theorem." in Algebraic number fields: L-functions and Galois 

properties (Proc. Sympos., Univ. Durham, Durham, 1975), pp. 409^164. 

Academic Press, London, 1977. 
[19] R. Lidl and H. Niederreiter, Finite Fields, Addison-Wesley Publishing 

Company, Reading, Massachusetts, 1983. 
[20] T. Kl0ve, T.-T. Lin, S.-C. Tsai, and W.-G. Tzeng, "Permutation arrays 

under the Chebyshev distance," IEEE Trans. Inform. Theory, vol. 56, 

no. 6, pp. 2611-2617, 2010. 
[21] F. J. MacWilliams and N.J. A. Sloane, The Theory of Error-Correcting 

Codes, North-Holland, Amsterdam, 1991. 
[22] N. Papandreou, H. Pozidis, T. Mittelholzer, G. F. Close, M. Breitwisch, 

C. Lam, and E. Eleftheriou, "Drift-tolerant multilevel phase-change 

memory," 3rd IEEE International Memory Workshop (IMW), Monterrey, 

CA, May 22-25, 2011, 4pp. 
[23] M.-Z. Shieh and S.-C. Tsai, "Decoding frequency permutation arrays 

under Chebyshev distance," IEEE Trans. Inform. Theory, vol. 56, no. 

11, pp. 5730-5737, 2010. 
[24] I. Tamo and M. Schwartz, "Correcting limited magnitude errors in the 

rank modulation scheme," IEEE Trans. Inform. Theory, vol. 56, no. 6, 

pp. 2551-2560, 2010. 
[25] H. Vinck, J. Haering and T. Wadayama, "Coded M-PSK for power 

line communications," Proc, 2000 IEEE Int. Symp. Inform. Theory 

(ISIT2000), Sorrento, Italy, 2000, p. 137. 
[26] Y. Yehezkeally and M. Schwartz, "Snake-in-the-box codes for rank 

modulation," larXiv: 1 107.33721 2011. 
[27] F. Zhang, H. D. Pfister and A. Jiang, "LDPC codes for rank modulation 

in flash memories," Proc. 2010 IEEE Internal Sympos. Inform. Theory 

(ISIT 2010), Austin, TX, June 13-18, 2010, pp. 859-863. 



References 

[1] M. Abramowitz and I. A. Stegun, Handbook of Mathematical Functions, 
Dover, 1965. 

[2] N. Alon and J. Spencer, The Probabilistic Method, 3rd Ed., J. Wiley & 
Sons, 2008. 

[3] A. Barg, and A. Mazumdar, "Codes in permutation and error correction 
for rank modulation," IEEE Trans. Inform. Theory, vol. 56, no. 7, 
pp. 3158-3165, 2010. 

[4] R. Blahut, Algebraic Codes on Lines, Planes and Curves, Cambridge 
University Press, 2008. 



